ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Tool
نویسندگان
چکیده
This paper presents ProphetMT, a tree-based SMT-driven Controlled Language (CL) authoring and post-editing tool. ProphetMT employs the source-side rules in a translation model and provides them as auto-suggestions to users. Accordingly, one might say that users are writing in a ‘Controlled Language’ that is ‘understood’ by the computer. ProphetMT also allows users to easily attach structural information as they compose content. When a specific rule is selected, a partial translation is promptly generated on-the-fly with the help of the structural information. Our experiments conducted on English-to-Chinese show that our proposed ProphetMT system can not only better regularise an author’s writing behaviour, but also significantly improve translation fluency which is vital to reduce the post-editing time. Additionally, when the writing and translation process is over, ProphetMT can provide an effective colour scheme to further improve the productivity of post-editors by explicitly featuring the relations between the source and target rules.
منابع مشابه
ProphetMT: A Tree-based SMT-driven Controlled Language Authoring/Post-Editing Tools
This paper presents ProphetMT, a tree-based SMT-driven Controlled Language (CL) authoring and post-editing tool. ProphetMT employs the source-side rules in a translation model and provides them as auto-suggestions to users. Accordingly, one might say that users are writing in a ‘Controlled Language’ that is ‘understood’ by the computer. ProphetMT also allows users to easily attach structural in...
متن کاملAn SMT-driven Authoring Tool
This paper presents a tool for assisting users in composing texts in a language they do not know. While Machine Translation (MT) is pretty useful for understanding texts in an unfamiliar language, current MT technology has yet to reach the stage where it can be used reliably without a post-editing step. This work attempts to make a step towards achieving this goal. We propose a tool that provid...
متن کاملRoundTrip Ontology Authoring
Controlled Language (CL) for Ontology Editing tools offer an attractive alternative to naive users, wishing to create Ontologies. However despite the benefits of CLs, users are still required to learn the correct syntactic structures in order to use the Controlled Language properly. Referring to CL vocubulary/syntax rules can be time consuming, annoying and in certain cases may prevent uptake o...
متن کاملAn Empirical Study of Segment Prioritization for Incrementally Retrained Post-Editing-Based SMT
Post-editing the output of a statistical machine translation (SMT) system to obtain high-quality translation has become an increasingly common application of SMT, which henceforth we refer to as post-editing-based SMT (PE-SMT). PE-SMT is often deployed as an incrementally retrained system that can learn knowledge from human post-editing outputs as early as possible to augment the SMT models to ...
متن کاملDepfix, a Tool for Automatic Rule-based Post-editing of SMT
We present Depfix, an open-source system for automatic post-editing of phrase-based machine translation outputs. Depfix employs a range of natural language processing tools to obtain analyses of the input sentences, and uses a set of rules to correct common or serious errors in machine translation outputs. Depfix is currently implemented only for English-to-Czech translation direction, but exte...
متن کامل